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ABSTRACT 

Human genetic diseases have been successfully 
corrected by integration of functional copies of the 
defective genes into human cells, but in some cases 
integration of therapeutic vectors has activated 
proto-oncogenes and contributed to leukemia. For 
this reason, extensive efforts have focused on 
analyzing integration site populations from patient 
samples, but the most commonly used methods 
for recovering newly integrated DNA suffer from 
severe recovery biases. Here, we show that a new 
method based on phage Mu transposition in vitro 
allows convenient and consistent recovery of inte- 
gration site sequences in a form that can be 
analyzed directly using DNA barcoding and pyro- 
sequencing. The method also allows simple estima- 
tion of the relative abundance of gene-modified 
cells from human gene therapy subjects, which 
has previously been lacking but is crucial for detect- 
ing expansion of cell clones that may be a prelude to 
adverse events. 

INTRODUCTION 

Human gene therapy has been carried out successfully for 
several diseases (1-8), but adverse events have occurred in 
which subjects developed leukemia associated with 



insertion of therapeutic vectors near proto-oncogenes 
(3,6,9,10). For this reason, it is important to track the 
location and abundance of different integration sites in 
cells from gene therapy-treated subjects. Tracking integra- 
tion sites is also of interest in the use of transposons as 
insertional mutations in model organisms and in basic 
studies of integrating genomic parasites (11,12). 

In the previously used protocols for integration site 
recovery in gene therapy, gene-corrected cells were 
isolated from patients, genomic DNA was purified and 
samples were typically cleaved with restriction enzymes. 
The exposed DNA ends were then ligated to adaptor 
DNAs and samples amplified using PCR with one 
primer complementary to the adaptor and the other com- 
plementary to the vector DNA terminus. Sites of integra- 
tion were identified by sequencing PCR products from a 
primer bound to the vector DNA, so that the sequence 
read extended into the flanking human DNA (13,14). It 
was noticed, however, that integration sites were not 
equally recovered by different enzymes, pointing to a 
recovery bias for this method. Although sites were effi- 
ciently recovered when found near restriction cleavage 
sites, they were difficult to detect when not positioned op- 
timally (15,16). Moreover, implementing restriction 
enzyme-based methods is complex — restriction enzymes 
need to be identified, which do not cleave DNA within 
amplicons of interest, only enzymes lacking CpG di- 
nucleotides in their recognition sites can be used due to 
biased distributions of CpG in mammalian DNA, and 
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large amounts of genomic DNA are needed. Thus, there is 
intense interest in alternative approaches. 

Improved methods are starting to be described, but all 
are in early stages (15-18). Concerns with available 
approaches include efficiency of recovery of rare integra- 
tion sites (16), and the need for large amounts of genomic 
DNA for analysis (17,18), which is often not available in 
gene therapy applications. 

Here we report a method for recovering sites of inte- 
grated DNA using the bacterial transposase MuA to 
introduce adaptors into genomic DNA to allow PCR 
amplification. This method is quick and simple, avoids 
the bias associated with restriction enzymes, recovers in- 
tegration sites in a near random fashion, and provides a 
simple measure of cell clonal abundance. 

MATERIALS AND METHODS 

Cell lines and gene therapy patient samples 

293T and SupTl cells were infected in vitro using a VSV-G 
pseudotyped MLV or HIV vector produced by transfect- 
ing 293T cells. To generate HIV-based vectors, cells were 
transfected with the LTR-GFP cassette plasmid 
pl56RRLsin-PPTCMVGFPWPRE (19), the packaging 
construct pCMVdeltaR9 (20) and the vesicular stomatitis 
virus G-producing plasmid pMD.G. VSV-G pseudotyped 
MLV particles were produced using pMD.G but in com- 
bination with the MLV vector segment (pMX-eGFP) and 
packaging construct pCGP (pCGP, kindly provided by 
Paul Bates). Human gene therapy samples consisted of 
PBMCs ((3-thalassemia) or CD3+ cells (SCID-X1). 
Details of gene therapy samples are reported elsewhere 
(1,2,7,8). The production of induced pluripotent cells con- 
taining a defined number of integration sites is reported in 
(21). 

Mu-mediated integration site recovery 

A detailed protocol for preparing Mu reactions is avail- 
able in the Supplementary Report 1 MuSOP. Briefly, 
reaction buffer, oligonucleotide donor, target DNA and 
water are mixed on ice followed by addition of purified 
Mu transposase. Reactions are then incubated at 30°C for 
2-4 h after which 2 ul of the reaction is used as input for 
PCR. Nested PCR was carried out as described (22) except 
that extension temperatures during cycling were changed 
to 70° C for the first seven cycles followed by 67° C for the 
remaining 37 cycles for samples with HIV-derived vectors. 
For both HIV- and MLV-based reactions, the number of 
cycles was reduced to 25 for nested PCR. Single round 
PCR amplification was performed with primers normally 
used in the nested round of PCR, each primer encoding 
454 adaptor sequences and the LTR primer encoding a 
DNA barcode (15). 

Integration site sequencing and analysis 

Integration sites were sequenced using 454 FLX platform 
technology. Sequences reads were trimmed to remove pri- 
mer sequence and aligned to the human genome (hgl8, 
version 36.1) using BLAT. Sequences were required to 



have a single best hit with >98% identity to the human 
genome, to align within 3 bp of the beginning of the se- 
quence read and to contain a perfect match to the expected 
LTR sequence lying downstream of the barcoded LTR 
primer. Comparisons to genomic features were carried 
out as described previously (23,24) using a combination 
of logistic regression and Bayesian model averaging 
supplemented by the random Forest machine learning al- 
gorithm. Gene expression analyses were based on data 
from 293T cells (25) with expression measured using the 
Affymetrix HU133 plus 2.0 gene chip array. Measuring 
sequence conservation at the site of restriction enzyme 
or Mu cleavage was performed using weblogo [http:// 
weblogo.berkeley.edu/ and (26)]. Statistical details for 
generating Lorentz curves and recovery probability plots 
can be found in Supplementary Report 2 Mu Recovery. 
Sequence data from this study was submitted to Genbank 
under accession numbers HR819863-HR863973. 



RESULTS 

We first quantified biases in the detection of integrated 
DNA using protocols relying on restriction enzyme cleav- 
age of genomic DNA. Analysis of the recoverability of an 
integration site based on its proximity to the nearest re- 
striction site showed that integrated DNA ~49bp from 
a cleavage site was recovered most frequently, and fre- 
quency of recovery decreased sharply at longer or 
shorter distances (Figure 1A). The case of SCID-X1 
patient 10 provides an example of complications due to 
this recovery bias (3,7). In this patient, an integrated 
vector activated expression of the nearby BMI1 
proto-oncogene, which was associated with massive ex- 
pansion of leukemic cells. When DNA from blood cells 
of patient 10 was cleaved and analyzed using four different 
restriction enzymes, only two enzymes allowed efficient 
recovery of the BMI1 integration site (Figure IB). In an 
extreme effort to circumvent these limitations, a study of 
SCID-X1 gene-corrected patients used up to six different 
restriction enzymes to analyze each individual sample, but 
even with the difficulty and expense of this large scale 
effort, recovery was still significantly biased (7). 

The improved method reported here (Figure 1C and D) 
substitutes MuA-directed transposition in vitro for restric- 
tion enzyme cleavage and adaptor ligation [for background 
and reviews of MuA function see (1 1,27-33)]. An engineered 
transposon end is used as donor (Figure 1C), which 
contains (i) binding sites for the MuA transposase; 
(ii) an adaptor region complementary to PCR primers; 
and (iii) an amine blocking group at one DNA 3'-end. 
In the presence of MuA transposase, the oligonucleotide 
donors become covalently joined to target DNA, allowing 
convenient installation of primer sites in human genomic 
DNA. PCR amplification is then carried out using primers 
complementary to the vector DNA end and the adaptor. 
Because the adaptor contains a 5' overhang and an amino- 
modified 3'-end, amplification must begin within the 
vector DNA and extend through the adaptor, preventing 
adaptor-to-adaptor amplification. As little as lOOng of 
genomic target DNA can be used. 
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Figure 1. The Mu-based integration site recovery method. (A) Severe recovery biases in previous methods using restriction enzyme cleavage. A large 
collection of integration sites generated from SCID-X1 gene therapy (Supplementary Table SI) were analyzed and plotted to show the relative 
recovery frequency for different distances between the restriction enzyme site used in genomic DNA cleavage and the vector integration site. The 
graph summarizes data obtained using six different restriction enzymes. The sharp peak documents the recovery bias (the location of the peak 
differed modestly for the different restriction enzymes studied; data not shown). (B) Biased recovery efficiency for four restriction enzymes in a 
sample from an adverse event. The integration site within BMI1 was implicated in an adverse event in SCID-X1 patient 10 (3). Each bar indicates the 
percent of all integration sites from the leukemic cell sample deriving from the BMI1 site for each of the three restriction enzymes or three 6-cutter 
cocktail (Avr I, Spe I and Nhe I) used for isolation. (C) The engineered Mu DNA donor used in these studies. 5' and 3' DNA ends are as marked. 
The 'N' indicates the position of an amino-modifier that blocks the DNA 3'-end to prevent adaptor-to-adaptor amplification. The dark blue indicates 
binding sites for MuA transposase, light blue a spacer region and green the adaptor sequence for PCR amplification. (D) The Mu-mediated 
integration site recovery method. MuA transposition is used to install the engineered Mu DNA donor (top), allowing PCR amplification 
(middle). The PCR primers contain DNA barcodes (black segments) and primers for use in 454/Roche pyrosequencing (orange). PCR products 
can be used directly for pyrosequencing without cloning in bacterial plasmids. 



Nested PCR is routinely carried out using the restriction 
enzyme method, and can be used with the Mu-mediated 
method. However, tests showed that for samples contain- 
ing relatively large numbers of integrated vectors, only a 
single round of PCR was sufficient to yield high quality 
sequence populations using the Mu-mediated method, and 
these data sets actually showed improved diversity 
(described in Supplementary Report 1 'Mu Standard 
Operating Procedure'). 

The PCR primers used for the final amplification step 
are composites, containing both the priming sequences 
and sequences required for 454/Roche pyrosequencing. 
Primer sequences also contain a DNA barcode between 
the 454 sequence and the priming region, allowing large 
numbers of amplicons to be pooled, sequences deter- 
mined, then sequence reads parsed using the barcodes 
(Figure ID) (34-36). Thus, hundreds of samples can be 
processed in pools. Pyrosequence reads are then trimmed, 
aligned to the human genome and distributions analyzed. 

To test the Mu-mediated method, we determined 25 194 
total integration site sequences, which yielded 3382 unique 
vector integration sites after condensing duplicates. We 



analyzed both HIV- and y-retrovirus-based vectors. 
Samples studied included cells from patients in two gene 
therapy trials, which treated SCID-X1 (3) and 
p-thalassemia (8) and tissue culture cells infected in vitro 
(summarized in Supplementary Table SI). 

We first compared recovery biases of the Mu and re- 
striction enzyme-based methods. We determined the 
sequence preferences for Mu integration in human DNA 
in vitro for 5968 integration site sequence reads that 
included the Mu-end oligonucleotide DNA. Alignment 
of human sequences at Mu integration sites revealed a 
detectable consensus sequence closely resembling that 
reported previously for Mu transposition (37), but with 
much lower information content than cleavage sites for re- 
striction enzymes (Figure 2A and B and Supplementary 
Report 2), indicating less bias in the cleaving/joining 
reactions. 

The performance of the restriction enzyme method was 
next compared with the Mu-mediated method by measur- 
ing recovery biases for each method, then annotating the 
human genome for calculated recovery rates based on the 
data. Thus, for any integration site in the human genome, 
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the likelihood of recovery at each base pair could be 
calculated for each recovery method (Supplementary 
Report 2). 

Figure 3A illustrates the relative recovery frequencies 
using the LM02 promoter as an example, which was 
chosen because the gene has been involved in several 
adverse events in SCID-X1 gene therapy (3,38). Note 
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Figure 2. Consensus sequences at points of adaptor addition. 
(A) Information content at junctions between the engineered Mu 
DNA and human DNA derived from vector integration site sequence 
reads. The .x-axis shows the DNA sequence position, where the site of 
joining to Mu DNA is between positions 10 and 11 (arrow). Perfect 
sequence conservation has information content of 2 bits (j-axis). Note 
that some bases have little or no information content, so no letters are 
visible. (B) Information content at adaptor junctions from vector inte- 
gration sites recovered after cleavage with the restriction enzyme Mse I, 
where the site of cleavage is between positions 10 and 11 (arrow). 



that although Figure 3A summarizes results on a single 
region of the genome, the biases were measured genome- 
wide from integration site data for each method and are 
shown at LM02 for purposes of illustration. Recovery 
was relatively consistent for the Mu-mediated method 
over all sites, whereas the restriction enzyme methods 
show sharp peaks and valleys, where valleys indicate lo- 
cations where an integration event would be difficult or 
impossible to isolate and the peaks frequently recovered 
sites that would mask more rare sites. Figure 3B shows the 
data plotted as the cumulative recovery frequency, where 
perfectly unbiased recovery would be indicated by a curve 
that followed the diagonal from lower left to upper right. 
The Mu-mediated recovery method most closely ap- 
proaches the diagonal and is significantly closer than even 
the method using six restriction enzymes (P< 0.001; see 
Supplementary Report 2 for statistical methods), docu- 
menting that the Mu-mediated method is the least biased. 

Another means of quantifying recovery biases involves 
comparing the increased sampling effort required to reach 
the results of a perfectly unbiased method. The slight 
biases introduced by the Mu method would require only 
a 10% increase in sampling effort to achieve the efficiency 
of a perfectly unbiased method (Supplementary Report 2). 
In contrast, for some restriction enzyme methods, the 
needed increase is too large to measure accurately 
(> 50-fold). For the pool of all six restriction enzymes, a 
45% increase in effort would be needed. Thus the Mu- 
mediated method yields less biased recovery with much 
less effort than any form of restriction enzyme-based 
method. 

We next investigated the experimental effort required to 
recover all members of a fully defined integration site 
population using the Mu-based method. We prepared an 
induced pluripotent stem (iPS) cell line in which all cells 
contained the same six known lentiviral vector integration 
sites. We carried out 12 independent integration site recov- 
ery reactions using the Mu-mediated method, recovering 
an average of 1068 sequence reads per replicate. We found 
that six of the 12 replicate reactions recovered all six sites. 
The replicates with fewer than six yielded either four or 
five of the sites. To assess the sampling effort required for 
recovery of all six sites, we pooled different numbers of 
Mu reactions computationally and assessed recovery. For 
pools of three of the 12 Mu reactions, 98% contained all 
six sites. For pools of five Mu reactions, 100% of the pools 
contained all six sites. Thus, 3-5 independent Mu reac- 
tions are enough to completely sample an integration 
site population of this size at the sequencing depth used. 
For comparison, three restriction enzymes were tested for 
recovery of the six sites, and only one yielded all six. 

The genome-wide distribution patterns of integration 
target sites for HIV and y-retroviruses have been studied 
extensively (13,14,23,39,40), allowing us to assess whether 
the Mu-mediated method reported similar trends. For 
studies of integration frequency near genomic landmarks, 
restriction enzyme-based methods usually provided an 
adequate overview, because the restriction biases are 
only weakly related to those landmarks. Figure 4A and 
B compares the distributions of integration sites isolated 
using the two methods for HIV and y-retrovirus-based 
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Figure 3. Reduced recovery bias using the Mu-mediated integration site isolation method. (A) Relative recovery rates compared at the LM02 
promoter for (i) the Mu-mediated method; (ii) six tests with single restrictions enzymes or pools or (iii) a mixture of all six. The recovery rates were 
calculated from data on the placement of integration sites relative to restriction enzyme cleavage sites or Mu transposase sites used in their isolation, 
using the data in Supplementary Table SI and statistical methods described in Supplementary Report 2. Calculated integration site recovery rates 
were then used to annotate each base over lOkb at the LM02 promoter (chrll, bases 33770412-33780411). 'Six enzymes' indicates pooled data for 
the six sets below. For a perfectly unbiased method, all such rates equal 1.0. (B) Statistical analysis of biases in recovery of integration sites. The 
.v-axis plots each base of the LM02 promoter analyzed above, treating each as a potential integration target. Sites were ranked by expected ease of 
isolation, with the easiest to isolate to the right. The y-axis shows the calculated proportion of sequences recovered given the measured recovery 
biases. Perfect unbiased recovery would follow a line from lower left to upper right. Statistical methods and P values for pair-wise comparisons are 
summarized in Supplementary Report 2. 



vectors. Integration sites are compared from infections of 
tissue culture cells and for one SCID-X1 patient (3,7). 
Figure 4A summarizes the relationship of integration 
sites to genomic features, using a heat map format to 
indicate increased or decreased integration frequency 
compared with random distributions. For both HIV and 
y-retroviral vectors, the genome-wide trends were closely 
similar for the Mu-mediated or restriction 
enzyme-mediated methods. Both HIV-based and 
y-retrovirus-based vectors favor integration in regions of 
high gene density and associated genomic landmarks. 
Gamma-retroviruses favor integration near gene 5'-ends. 
HIV and y-retroviruses show a complex pattern of favored 
and disfavored integration sites near regions of histone 
methylation, acetylation and bound proteins, as indicated 
by comparison to data from ChlP-seq experiments (41- 
44) (Figure 4B), again showing favored integration near 
marks of active transcription. These patterns matched 
closely for the Mu-mediated and restriction-enzyme- 
based methods. The genome-wide patterns for SCID-X1 
gene therapy closely paralleled those seen for y-retroviral 
vectors, as reported previously (3,6,7). Thus, conclusions 
on genome-wide distributions of integration sites from the 
Mu-mediated method parallel those from extensive 
previous studies. 

The Mu-mediated method also allows a new approach 
to quantifying the relative frequency of gene-corrected 
cells in patient samples. Gene corrected cell clones that 
are present in many copies will contribute a relatively 



larger proportion of their integration site DNA to the 
genomic DNA pool after purification, providing an incr- 
eased number of target sites for Mu integration in vitro. As 
a result, relatively larger numbers of independent Mu 
integration events will lead to recovery of the same high 
abundance vector integration site. Quantifying the number 
of independent Mu integration sites per vector integration 
site therefore provides a simple measure of the relative 
abundance. 

We tested this in a sample from a P-thalassemia gene 
therapy trial, in which a cell harbouring a single vector 
integration site in the HMGA2 gene expanded to comprise 
more than one-third of the gene-corrected peripheral 
blood mononuclear cells (PBMC) population, as docu- 
mented by quantitative PCR assays (8). We recovered 10 
independent Mu integration sites for the HMGA2 vector 
integration site, while all other vector integration sites 
were recovered with only a single Mu site (Figure 4C). 
This finding makes the important clinical point that the 
HMGA2 site is the only high abundance integration site in 
the expanded cell clone. Using the restriction enzyme 
cleavage method for integration site recovery, only two 
out of three of the enzymes used allowed isolation of the 
HMGA2 site (data not shown). 

Lastly, we present a method for suppressing PCR con- 
tamination. In assessing collections of gene therapy sam- 
ples, multiple tubes are commonly processed in parallel 
using nested PCR, providing an ideal setting for migration 
of PCR products between samples. In practice controlling 
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Figure 4. Mu-mediated integration site recovery. (A) Integration frequency near genomic land marks. (B) Integration frequency near sites of histone 
methylation, acetylation or bound chromosomal proteins. Data sets compared are indicated by the column headings, features analyzed by the rows. 
Heat maps were constructed using the receiver operating characteristic (ROC) area method to compare the observed distributions to random 
distributions of integration sites (23,45). ROC areas of 0.5 indicated integration sites are present near the indicated genomic features as often as 
expected by chance. ROC areas >0.5 indicate positive association and areas <0.5 negative association. Associations are colour coded as indicated by 
the key at the bottom of each map. In (A), for the Gene Density, Expression Density and GC content measures, several different length genomic 
intervals were used for comparisons, which are indicated by numbers to the right of the black bar. In (B), ChlP-seq analysis was used to map the 
genome-wide distributions of sites of histone post-translational methylation or acetylation, or bound DNA binding proteins (41-44), and the results 
compared with integration site distributions. Statistical analysis shows that most associations where discernable colour can be seen in a heat-map tile 
achieve statistical significance. (C) Comparisons of the numbers and positions of Mu integration sites that allowed recovery of the vector integration 
sites at HMGA2, VPS13B and POLA2 generated during gene therapy for p-thalassemia. 



contamination is challenging even for experienced experi- 
mentalists. We have devised a method that suppresses this, 
in which separate adaptors are used for each sample, so 
that contaminating PCR products are not amplifiable 
outside the correct PCR reaction. In reconstruction ex- 
periments, this has been effective at suppressing cross-over 
in our laboratory (data not shown). Use of multiple 
adaptors is described in the Standard Operating 
Procedure in the Supplementary Data for this article. 

DISCUSSION 

In summary, the Mu-mediated integration site recovery 
method allows simplified recovery of integration sites 
and estimation of relative abundance. Use of protocols 
based on single restriction enzymes results in failure to 
isolate integration sites that are not near restriction 
enzyme recognition sites (7,15,16), which can be crucial 



in monitoring adverse events during gene therapy. The 
challenges posed by biased isolation have been under- 
estimated in some of the early literature in this field. 
One previous study attempted to circumvent recovery 
biases by using six restriction enzymes to study single 
samples, but even with this added complication and 
expense, recovery using this method is still more biased 
than with the Mu-based method (Figure 3B; P< 0.001). 
Thus in cases where it is critical to use integration site 
data to identify expanded cell clones, the Mu method is 
attractive and convenient. 

Methods for deep sequencing are in a state of rapid tran- 
sition. The Mu-mediated method described here can in 
principal be adapted to any of the next generation se- 
quencing platforms. At this writing, the Solexa/Illumina 
method is least expensive per base, but potentially incon- 
venient because analysis of a handful of samples will often 
take up only a fraction of a run, requiring complicated 
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coordination with others to fill out a run. The 454/Roche 
method is more expensive per base, but the availability of 
a benchtop instrument for smaller runs (the 'Junior' in- 
strument http://www.gsjunior.com/) simplifies through- 
put. It is highly likely that additional sequencing 
methods will become available in the near future that 
may also be useable with the Mu-mediated method. 

The demands placed on integration site recovery tech- 
nology vary by disease state. The frequency of gene cor- 
rected cells varies from 100% (T cells in SCID-X1) to 
<1% [early adenosine deaminase (ADA) studies], so the 
demands on the technology differ for different diseases. 
For very low level clones, recovering integration sites 
and estimating their abundance is at present challenging 
for any technology. 

Additional methods are starting to be proposed for in- 
tegration site analysis, including methods based on DNA 
shearing (N. Gillet, N. Malani, N. Gormley, R. Carter, A. 
Melamed, D. Bentley, C. Berry, F. Bushman, G. Taylor 
and C. Bangham, submitted for publication) or limited 
extension from integrated vectors with a DNA polymerase 
followed by RNA ligation ('nrPCR') (16). Each of these 
methods is of interest but each may have inefficient steps. 
For the Mu-mediated method, it can be challenging to 
obtain enough Mu integration events to query the full 
human genome efficiently, though the method is suitable 
for analysis of large numbers of samples with small 
amounts of starting genomic DNA. Ongoing use has 
shown the method to be effective in practice. For DNA 
shearing, it can be challenging to obtain efficient ligation 
after repairing broken DNA ends, and to work with small 
amounts of DNA. For the nrPCR method, efficiency may 
be an issue (16). 

Two reports in the peer-reviewed literature document 
the utility of the Mu-mediated method. In one case, 
oligoclonal reconstitution during lentiviral vector- 
mediated gene correction was documented using 
Mu-mediated recovery of integration sites during 
(3-thalassemia gene correction in mice (46). In this case, 
the inferred rank order of integration site abundance from 
sequence read counts was similar to that inferred by quan- 
tifying the number of independent Mu-transposition 
events in vitro that led to site recovery. The congruence 
of these two measures supports the idea that quantifica- 
tion relative to clonal abundance was consistent. In the 
second study (21) the Mu-mediated method was used 
along with other methods and found to be comparably 
efficient. Long term, it will be useful to compare the ef- 
fectiveness and convenience of present and future methods 
for quantifying integration site abundance on defined in- 
tegration site populations. 
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